AITopics | rotation and translation

Collaborating Authors

rotation and translation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Interpreting Representation Quality of DNNs for 3DPoint Cloud Processing: Supplementary Materials Wen Shenb Qihan Rena Dongrui Liua Quanshi Zhanga aShanghai Jiao Tong UniversitybTongji University

Neural Information Processing SystemsApr-25-2026, 18:23:00 GMT

This section provides more details about Shapley values in Section 3 of the paper. Linearity: If two independent games vand wcan be merged into one game u(S) = v(S)+w(S), then the Shapley value of the player i in game v and game w also can be merged, i.e. φu(i) = φv(i)+φw(i). Nullity: A dummy player isatisfies S N\{i},v(S {i}) = v(S)+v({i}), which indicates that the player ihas no interaction with other players, i.e. φ(i) = v({i}). Efficiency: The overall reward can be allocated to all players in the game, i.e. This section provides more details about multi-order interactions [8] in Section 3.3 of the paper.

artificial intelligence, machine learning, sensitivity, (18 more...)

Neural Information Processing Systems

Country: Asia > China (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Explicitly disentangling image content from translation and rotation with spatial-VAE

Tristan Bepler, Ellen Zhong, Kotaro Kelley, Edward Brignole, Bonnie Berger

Neural Information Processing SystemsFeb-12-2026, 07:07:48 GMT

Neural Information Processing Systems http://nips.cc/

dataset, latent variable, rotation and translation, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.15)
North America > Canada (0.04)

Industry: Health & Medicine (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)
Information Technology > Sensing and Signal Processing > Image Processing (0.93)
Information Technology > Artificial Intelligence > Vision (0.88)
(2 more...)

Add feedback

3fa7d76a0dc1179f1e98d1bc62403756-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 12:47:05 GMT

amino acid, invariant, rotation, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.46)

Add feedback

SupplementaryMaterials

Neural Information Processing SystemsFeb-8-2026, 12:46:55 GMT

Efficiency: The overall reward can be allocated to all players in the game,i.e. This section provides more details about multi-order interactions [8] in Section 3.3 of the paper. The multi-order interaction satisfies axioms oflinearity, nullity, commutativity, symmetry, and efficiency[8],asfollows. This study was done under the supervision of Dr. Quanshi Zhang. This section provides more details about the use of the ShapeNet part dataset in the paper.

artificial intelligence, data augmentation, machine learning, (15 more...)

Neural Information Processing Systems

Country: Asia > China (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.48)

Add feedback

Explicitly disentangling image content from translation and rotation with spatial-VAE

Neural Information Processing SystemsDec-25-2025, 10:32:59 GMT

Given an image dataset, we are often interested in finding data generative factors that encode semantic content independently from pose variables such as rotation and translation. However, current disentanglement approaches do not impose any specific structure on the learned latent representations. We propose a method for explicitly disentangling image rotation and translation from other unstructured latent factors in a variational autoencoder (VAE) framework. By formulating the generative model as a function of the spatial coordinate, we make the reconstruction error differentiable with respect to latent translation and rotation parameters. This formulation allows us to train a neural network to perform approximate inference on these latent variables while explicitly constraining them to only represent rotation and translation. We demonstrate that this framework, termed spatial-VAE, effectively learns latent representations that disentangle image rotation and translation from content and improves reconstruction over standard VAEs on several benchmark datasets, including applications to modeling continuous 2-D views of proteins from single particle electron microscopy and galaxies in astronomical images.

image content, rotation and translation, translation and rotation, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.60)

Add feedback

Unsupervised Object Representation Learning using Translation and Rotation Group Equivariant VAE

Neural Information Processing SystemsDec-24-2025, 08:06:35 GMT

In many imaging modalities, objects of interest can occur in a variety of locations and poses (i.e. are subject to translations and rotations in 2d or 3d), but the location and pose of an object does not change its semantics (i.e. the object's essence). That is, the specific location and rotation of an airplane in satellite imagery, or the 3d rotation of a chair in a natural image, or the rotation of a particle in a cryo-electron micrograph, do not change the intrinsic nature of those objects. Here, we consider the problem of learning semantic representations of objects that are invariant to pose and location in a fully unsupervised manner. We address shortcomings in previous approaches to this problem by introducing TARGET-VAE, a translation and rotation group-equivariant variational autoencoder framework.

rotation, target-vae, unsupervised object representation learning, (8 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.75)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.57)

Add feedback

Explicitly disentangling image content from translation and rotation with spatial-VAE

Tristan Bepler, Ellen Zhong, Kotaro Kelley, Edward Brignole, Bonnie Berger

Neural Information Processing SystemsOct-2-2025, 19:21:56 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, latent variable, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.15)

Industry: Health & Medicine (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)
Information Technology > Sensing and Signal Processing > Image Processing (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Learning Point Cloud Representations with Pose Continuity for Depth-Based Category-Level 6D Object Pose Estimation

Li, Zhujun, Zhang, Shuo, Stamos, Ioannis

arXiv.org Artificial IntelligenceAug-21-2025

Category-level object pose estimation aims to predict the 6D pose and 3D size of objects within given categories. Existing approaches for this task rely solely on 6D poses as supervisory signals without explicitly capturing the intrinsic continuity of poses, leading to inconsistencies in predictions and reduced generalization to unseen poses. T o address this limitation, we propose HRC-Pose, a novel depth-only framework for category-level object pose estimation, which leverages contrastive learning to learn point cloud representations that preserve the continuity of 6D poses. HRC-Pose decouples object pose into rotation and translation components, which are separately encoded and leveraged throughout the network. Specifically, we introduce a contrastive learning strategy for multi-task, multi-category scenarios based on our 6D pose-aware hierarchical ranking scheme, which contrasts point clouds from multiple categories by considering rotational and translational differences as well as categorical information. W e further design pose estimation modules that separately process the learned rotation-aware and translation-aware embeddings. Our experiments demonstrate that HRC-Pose successfully learns continuous feature spaces. Results on REAL275 and CAM-ERA25 benchmarks show that our method consistently outperforms existing depth-only state-of-the-art methods and runs in real-time, demonstrating its effectiveness and potential for real-world applications.

artificial intelligence, machine learning, pose estimation, (16 more...)

arXiv.org Artificial Intelligence

2508.14358

Genre: Research Report (1.00)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision > Video Understanding (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Unsupervised Object Representation Learning using Translation and Rotation Group Equivariant V AE (Supplementary Material)

Neural Information Processing SystemsAug-15-2025, 07:05:35 GMT

A.1 Calculating Kullback-Leibler divergence Based on the standard definition for the KL-divergence, we have: KL ( q (z, θ, t, r| y)||p( z, θ, t, r)) = null We generated two datasets of MNIST(N) and MNIST(U), by rotating and translating digits in MNIST. Images in both of the datasets are 50x50 pixels. A.3 Digit-wise rotation correlation, and RMSE of the predicted rotations We created a new dataset using multiple rotated and translated digits from MNIST(U). Some predicted rotations for digits 0, 1, and 8 are off by π from their ground-truth values. We find that the model correctly identifies and reconstructs the objects (Figure 3).

dataset, mnist, rotation, (12 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Jordan (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.97)

Add feedback

Filters

Collaborating Authors

rotation and translation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Interpreting Representation Quality of DNNs for 3DPoint Cloud Processing: Supplementary Materials Wen Shenb Qihan Rena Dongrui Liua Quanshi Zhanga aShanghai Jiao Tong UniversitybTongji University

Explicitly disentangling image content from translation and rotation with spatial-VAE

3fa7d76a0dc1179f1e98d1bc62403756-Supplemental-Conference.pdf

SupplementaryMaterials

Explicitly disentangling image content from translation and rotation with spatial-VAE

Unsupervised Object Representation Learning using Translation and Rotation Group Equivariant VAE

Explicitly disentangling image content from translation and rotation with spatial-VAE

Learning Point Cloud Representations with Pose Continuity for Depth-Based Category-Level 6D Object Pose Estimation

Unsupervised Object Representation Learning using Translation and Rotation Group Equivariant V AE (Supplementary Material)

624fa2f9200f3df11a4a80f6d880ccc2-Paper-Conference.pdf